Model Selection

High-Fidelity Speech Synthesis

# High-Fidelity Speech Synthesis

Parler Tts Large V1

A 2.2 billion parameter text-to-speech model trained on 45,000 hours of audio data, supporting voice feature control via text prompts

Speech Synthesis

Transformers English

Vocos Mel Hifigan Compat 44100khz

Vocos is a fast neural vocoder that achieves efficient audio reconstruction by generating spectral coefficients, particularly suitable for text-to-speech tasks.

Speech Synthesis

TensorBoard Other

This is a Japanese text-to-speech (TTS) model trained on the ESPnet2 framework, using the VITS architecture, completed by mio on the amadeus dataset.

Speech Synthesis Japanese

Gunnarthor Talromur A Fastspeech2

A FastSpeech2 text-to-speech model trained on the ESPnet framework and talromur dataset, supporting Icelandic speech synthesis.

Speech Synthesis English

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase